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Abstract 

We systematically study dijet production in various processes in the small- X limit and establish 
an effective /c^-factorization for hard processes in a system with dilute probes scattering on a dense 
target. We find that the well-known Weizsacker- Williams gluon distribution can be directly probed 
in the quark-antiquark jet correlation in deep inelastic scattering and the dipole gluon distribution 
can be directly measured in the direct photon-jet correlation in pA collisions. In the large- iV c limit, 
the unintegrated gluon distributions involved in other different dijet channels in pA collisions are 
shown to be related to two widely proposed ones: the Weizsacker- Williams gluon distribution and 
the dipole gluon distribution. 
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I. INTRODUCTION 



Factorization is part of the foundations of high-energy hadronic physics, as it provides the 
key ingredient for the phenomenological studies of high-energy experiments. Factorization 
theorems make the separation between short-distance perturbative physics and long-distance 
nonperturbative effects possible. Thus, cross sections measured in high-energy experiments 
can be factorized into products of hard parts (short-distance physics) and parton distri- 
butions (nonperturbative physics). In addition, an essential part of factorization is the 
universality of the parton distributions, among different processes. 

While collinear factorization has been the most widely used framework in phenomenolog- 
ical studies, and remains a sufficiently good approximation of QCD for the most inclusive 
processes in hadronic collisions, the investigation of less inclusive observables showed the 
need for a transverse-momentum dependent (TMD) factorization. During the last decade, 
a large amount of work has been devoted to establish such a framework in QCD. However, 
recent progress [3-0] has shown that TMD factorization is violated for dijet production in 
hadron-hadron (e.g., pp) collisions, due to a loss of universality. 

In this paper, we propose a solution to this problem in the small- a; limit: we succeeded in 
establishing an effective TMD factorization for hard processes in collisions of dilute probes 
off dense hadrons (or large nucleijE We confirm that TMD parton distributions are not 
universal, but we show that at small- a; they can be constructed from several universal indi- 
vidual building blocks. This is achieved by working with an appropriate approximation of 
QCD in the small-x limit of QCD, where large parton densities and non-linear saturation 
effects are crucial. 

The saturation phenomena in high-energy collisions has attracted great attention in the 
last two decades. At very high energies corresponding to the low-x regime, parton distri- 
butions reach very high densities and non-linear effects become important in describing the 
dynamics of the hadronic system [8-llj]. The transition to the saturation regime is char- 



acterized by the saturation scale, which is interpreted as the typical transverse momentum 
of the small- x partons, and is also related to the transverse color charge density in the in- 
finite momentum frame of the dense target. It has been argued [lCj that the high density 
of gluons inside a hadron or nucleus allows for a semiclassical treatment of the color field, 
leading to the Color Glass Condensate (CGC) effective description of the small-x part of 
the hadronic/nuclear wave function which has been widely used to systematically study 
saturation physics 

Experimental data is still not conclusive in this matter, but strong evidence of these 
effects have been found in the deep inelastic scattering (DIS) experiments at HERA and 



deuteron-gold collisions at RHIC It is expected that saturation physics will play an 

important role in explaining the results of the ongoing measurements of single-inclusive 
production and two-particle correlations in the forward region at RHIC as well as future 
heavy-ion experiments at LHC. In addition, the planned Electron-Ion Collider [12[ will be 
able to provide ideal experimental conditions to study the low- a; parton distributions and 
thus test the saturation physics in both protons and large nuclei. 

In saturation physics, two different unintegrated gluon distributions (UGDs) have been 

1 Note that this effective TMD factorization does not hold for high energy pp and AA collisions due to 
final state soft gluon exchanges from both the projectile and the target to the hard part. See Ref. @ for 
detailed discussion. For the case of dilute projectiles scattering on a dense target, we can always neglect 
the soft gluon exchanges from the dilute projectile to the hard part while we resum all the soft gluon 
exchanges attached the dense target to the hard part since the gluon field is much stronger in the dense 
target. 2 




FIG. 1. Schematic diagrams for two-particle production in a dilute system scattering on a dense 
target with multiple scattering. The imbalance between the two-particle in transverse momentum 
can be used to probe the unintegrated gluon distribution of the dense target. 

widely used in the literature. The first gluon distribution, which is known as the Weizsacker- 
Williams (WW) gluon distribution, is calculated from the correlator of two classical gluon 



fields of relativistic hadrons (non-abelian Weizsacker- Williams fields) [lfj, [13J. The WW 
gluon distribution has a clear physical interpretation as the number density of gluons inside 
the hadron in light-cone gauge, but is not used to compute cross sections. On the other hand, 
the second gluon distribution, which is defined as the Fourier transform of the color dipole 
cross section, does not have a clear partonic interpretation, but it is the one appearing 
in most of the A^-factorized formulae found in the literature for single-inclusive particle 



production in pA collisions [11 

It was a long-standing question what is fundamentally different between these two gluon 



distributions, and whether there is any observable sensitive to the WW distribution [14 
The objective of this paper is to answer these questions and show that these two gluon 
distributions are the fundamental building blocks of all the TMD gluon distributions at 
small x. Eventually, this leads us to an effective TMD-factorization for dijet production, in 
the collision of a dilute probe with a dense target. We find that, in the small momentum 
imbalance limit described below, the dijet production process in DIS can provide direct 
measurements of the WW gluon distribution and the photon-jet correlations measurement 
in pA collisions can access the dipole gluon distribution directly. In addition, other more 
complicated dijet production processes in pA collisions will involve both of these gluon 
distributions through convolution in transverse momentum space, when the large- N c limit 
is taken. 



A short summary of our study has been published in Ref. [151 ] . Here we present the 
detailed derivations, and the precise equivalence between the TMD and CGC approaches, 
in the overlapping domain of validity, i.e. to leading power of the hard scale and in the 
small x limit. In general, the TMD factorization is valid whatever x is but is a leading-twist 
approach, while the CGC is applicable only at small x but contains all the power correc- 
tions. Since the main objective of this paper is to understand dijet production processes 
theoretically, we will put the phenomenological studies in a future work. 

We focus on the two particle production (or dijet production at higher energy) in the 
case of a dilute system scattering on a dense target, as illustrated in Fig. [fl 

B + A^H l {k l )+H 2 {k 2 )+X , (1) 
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where A represents the dense target (we shall call it a nucleus in the following), B stands for 
the dilute projectile (such as a photon or a high-x parton in a hadron), H\ and H 2 are the final 
state two particles with momenta k\ and k 2 , respectively. Let us denote as ib the light-cone 
momentum fraction of the parton (or virtual photon) from the incoming projectile B, and 
as x g <C 1 the momentum fraction of the gluon from the incoming target. We are interested 
in the kinematic region where the transverse momentum imbalance between the outgoing 
particles is much smaller than their individual momenta: q± = \k\j_ + k 2 ±\ «C k\ ~ k 2 — P± 
where P± is defined as (ku_ — k 2 j_)/2. This is referred to as the back-to-back correlation 
limit (the correlation limit) in the following discussions. An important advantage of taking 
this limit is that we can apply the power counting method to obtain the leading order 
contribution of q±/P± where the differential cross section directly depends on the UGDs of 
the nuclei. 

For each individual dijet production process, we employ two independent approaches, 
namely the TMD approach and the CGC approach^]. The TMD approach is straightforward 
and clear in terms of factorization. On the other hand, the CGC approach is commonly 
used in dealing with small- a; calculations. It allows to go beyond the correlation limit, 
which gives a deeper access to the QCD dynamics at small x, but this is not the purpose 
of this paper. In this more general situation, cross sections involve multi-gluon distribution 
functions, as expected due to parton saturation and multiple scatterings, and therefore there 
is no fc t -factorization. Except for the most inclusive observables (such as inclusive and semi- 
inclusive DIS, single-gluon and valence quark production in pA collisions), k t factorization 
is only a property of the linear BFKL regime. However taking the correlation limit allows 
to simplify the dijet production results of the CGC, and to obtain an effective factorization 
which coincides with that found in the TMD approach. 

The Weizsacker- Williams gluon distribution can be defined following the conventional 
gluon distribution 17, 18| 

zG«(s,*jl) = I ^^e^-- lk ^(P\P + \r^±)4^P + \0)\P) , (2) 

where is the gauge field strength tensor F£ v = d^A u a — d v A% — gf ahc A^A v c with f abc the 
antisymmetric structure constants for SU(3), and 

/»oo /»oo 

Cz = Vexp{-ig dCA + ((,Z ± )}Vexp{-ig d( ± ■ A ± (C = 00, (±)} 

is the gauge link in the adjoint representation A^ = A^t a with t a = —ifabc- It contains 
a transverse gauge link at spatial infinity which is important to make the definition gauge 



invariant [20|. These gauge links have to be made non-light-like to regulate the light-cone 
singularities when gluon radiation contributions are taken into account (17J. In the above 
definition, we assume that the hadron is moving along the +z direction. The light-cone 
momenta P ± are defined as P ± = (P° ± P z )/\/2. This gluon distribution can also be 
defined in the fundamental representation [2|], 



7 ^§T e "' p+r " ifc± ' 5±(p|Tr [F +i (r,u)^F +i m [+] ] \p) , (3) 



2 Our formulation of the CGC approach leads to similar intermediate steps as in Ref. 16j. However, we 
treat the n-point functions differently by using Wilson lines. 
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where the gauge link = U n [0, +00; 0] U n [+00, £ ; with U n being reduced to the 

light-like Wilson line in covariant gauge. It is straightforward to see that ti^ represents the 
final state interactions according to its future integration path to +00. 

By choosing the light-cone gauge with certain boundary condition for the gauge potential 
(A±((~ = 00) = for the specific case above), we can drop out the gauge link contribution in 
Eqs. ([2]) and ([3]) and find that this gluon distribution has the number density interpretation. 
Then, it can be calculated from the wave functions or the WW field of the nucleus target 
0, 1 1 31 ] - Within the CGC framework, this distribution can be written in terms of the 



correlator of four Wilson lines as (see Section IIIBj) 



xCF>(x, k ± ) = -— f ^r 2 ^r 2 e-^(— ') <Tr [W(v)] U\v') fetf(t/)] U\v)) x , (4) 
as J {In) {In) 9 

where the Wilson line U(x±) is defined as U n [—00, +00; x±\. At small- 2 for a large nucleus, 
this distribution can be evaluated using the McLerran-Venugopalan mo dei[l0| 

xG w (x, k± = — — — / 2 — !- e 4 ' 5 

7r 2 a s N c J (27r) 2 r]_ \ J 

where N c = 3 is the number of colors, S± is the transverse area of the target nucleus, and 



Q 2 S = In -Ay J dx /i 2 (x ) is the gluon saturation scale jllj with jj 2 the color charge 

density in a large nuclei. We have cross checked this result by directly calculating the 
gluon distribution function in Eq. (j2J) following the similar calculation for the quark in 
Ref. [20], 21]. The derivation of the WW gluon distribution from its operator definition is 



provided in Appendix IA II 

The second gluon distribution, the Fourier transform of the dipole cross section, is defined 
in the fundamental representation^ 

xG^(x, k ± ) = 2 J ^0^ e ixP+ ^- ik ^(P\Tr [F+*(C, a)W Ht ^(0)W [+1 ] \P) , (6) 

where the gauge link ujf^ = U n [0, — 00; 0] U n [— oo,^~;^j_] stands for initial state interac- 
tions. Thus, the dipole gluon distribution contains both initial and final state interactions 
in the definition. 

WW and U [ ~ ] are the gauge links which appear in the quark distributions in the DIS and 
Drell-Yan process, respectively. It is well-known that there is only final state effect in the 
DIS, while there is only initial state interaction in the Drell-Yan process. In addition, in 
processes involving gluons and more complicated partonic structures, more complex gauge 
links may appear, such as combinations of and US~\ [2j. We will see this in our following 
calculations especially in dijet production in pA collisions. 

For the second gluon distribution xG^ as shown in Eq. (jBJ), the gauge link contribution 
can not be completely eliminated. In other words, there is no number density interpretation 
for this gluon distribution. This is also because it contains both initial and final state 



3 To obtain this result, it was assumed that the color charge densities in the nucleus obey a Gaussian 
distribution with variance /i 2 . It was recently argued that this assumption is inconsistent with the QCD 
non- linear evolution [l9| . except for two-point functions. 

4 The Fourier transform of the dipole cross section in the adjoint representation is also commonly used, 



as it enters single gluon production in pA collisions 22h24|. In the large- N c limit, it is related to the 
convolution of two xG^ . 
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interaction effects. Due to the gauge link in this gluon distribution from — oo to +00, 
naturally this gluon distribution can be related to the color-dipole cross section evaluated 
from a dipole of size r± scattering on the nucleus target, and has been calculated in the 
CGC formalism, 

The derivation of this dipole gluon distribution from its operator definition is provided in 
Appendix IA21 

These two gluon distributions have been intensively investigated in the last few years^l 
In particular, it was found that they 

• have the same perturbative behavior. They both scale as Q 2 s /q\ at large transverse 
momentum q± 3> Q s ] 

• however, they differ dramatically at small transverse momentum: G^ 1 ' ~ \nQl/q"j_ 
whereas ~ q\. 

It will be very important to test these predictions by measuring the quark-antiquark corre- 
lation in DIS process and direct photon jet correlation in pA collisions, since these processes 
can directly probe these two gluon distributions separately. 

The second gluon distribution (xG^) depends on the dipole cross section, which appears 
in various inclusive and semi-inclusive processes. For example, the total cross section (or the 
structure functions) in DIS, the single inclusive hadron production in DIS and pA collisions, 
and the Drell-Yan lepton pair production in pA collisions, are all depending on this dipole 
gluon distribution. Tremendous phenomenological analysis have been performed to constrain 
this gluon distribution from the experimental data. 

On the other hand, the first gluon distribution (xG^ 1 ') only appears in few physical 
processes. Thus, we do not have much constraints on its behavior. The only knowledge 
comes from model calculations (i.e., the GBW model [25[ which provides a good description 
of all DIS data below x = 0.01). Therefore, it is very crucial to carry out experimental 
observation of the quark-antiquark jet correlation in DIS process in the planed Electron- Ion 
collider, which shall provide very important information on this gluon distribution. 

Two particle production in pA collisions are found to depend on both gluon distribu- 
tions |l5|. In Table I, we summarize the current status for the two gluon distributions 
probed in high energy processes, where we find that the dipole gluon distribution con- 
tributes to most of them, such as inclusive DIS, semi-inclusive DIS(SIDIS) j26|, Drell-Yan 
(DY) processes, single inclusive hadron production in pA collisions, photon-jet correlations 
and dijet in pA collisions, whereas the WW gluon distribution only appears in the quark- 
antiquark dijet correlation in DIS and dijet correlations in pA collisions. It is important 
to note that our derivations for the two basic processes, where the two distributions can 
be measured independently (dijet correlations in DIS and photon-jet correlation in pA col- 
lisions), are exact for finite N c . The large- N c limit is only necessary for more complicated 

5 There have been an observation that these two UGDs can be related through a mathematical formula 
xG^p {x, q±) oc g^V^xG^^a;, q±) where xG^\x,qj_) stands for the gluon distribution in the adjoint 
representation which is derived from a dipole formed by two gluons (e.g., see ref. 1J]). However, we believe 
that this relation is just a mathematical observation without any physics derivation. In addition, we find 
that it only works for MV model which assumes the local gaussian approximation. This mathematical 
relation is invalidated beyond the local gaussian approximation, (e.g., see Appendix I A II ) From the above 
operator definition of these two UGDs, we can see that they are two independent gluon distributions. 
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DIS and DY 


SIDIS 


hadron in pA 


photon-jet in pA 


Dijet in DIS 


Dijet in pA 


G« (WW) 


X 


X 


X 


X 


V 


V 


(dipole) 


V 


V 


V 


V 


X 





TABLE I. The involvement of these two gluon distributions in high energy processes. 



processes where it allows us to write the new distributions as convolutions of the two basic 
ones. 

In the following sections, we will carry out the detailed derivations for the two-particle 
correlations in these processes. Quark-antiquark correlation in DIS process will be calculated 
in Sec. II. Sec. Ill will be devoted to the direct photon jet correlation in pA collisions. We 
will derive the formalism for dijet correlation in pA collisions in Sec. IV. Summary and 
further discussions will be given in Sec. V. In all these calculations, we will show the results 
from both transverse momentum dependent approach and the CGC calculations and we will 
demonstrate that they are consistent in the correlation limit. 



II. DIJET PRODUCTION IN DIS 



Despite the nice physical interpretation, it has been argued that the gluon distribution 
in Eq. ([2]) is not directly related to physical observables in the CGC formalism. However, 
we will show that xG^ can be directly probed through the quark-antiquark jet correlation 
in DIS, 

j* T A ^ qih) + q(k 2 ) + X , (8) 

where the incoming (virtual) photon carries momentum fc 7 », the target nucleus has momen- 
tum Pa, and the final state quark and antiquark with momenta k\ and k 2 , respectively. 
Again, we focus on the kinematic region with the correlation limit: q± = \ki± + k 2 ±\ <C P±. 
The transverse momenta are defined in the center of mass frame of the virtual photon 7* 
and the nucleus A. The calculations are performed for Q 2 in the same order of P\. As 
we discussed in the above, we take the leading order contribution in the correlation limit: 
q± <C P±, and neglect all higher order corrections. We plot the typical Feynman diagram 
for the process of ([8]) in Fig. [2j where the bubble in the partonic part represents the hard 
interaction vertex including gluon attachments to both quark and antiquark lines. Fig. [2] (a) 
is the leading Born diagram whose contributions can be associated with the hard partonic 
cross section times the gluon distribution from Eq. (jSJ) In high energy scattering with 
the nucleus target, additional gluon attachments are important and we have to resum these 
contributions in the large nuclear number limit. Figs. l(b,c) represent the diagrams con- 
tributing at two-gluon exchange order, where the second gluon can attach to either the quark 
line or the antiquark line. By applying the power counting method in the correlation limit 
(q± <C P±)i we can simplify the scattering amplitudes with the Eikonal approximation jjjj]. 
For example, Fig. [2] (b) can be reduced to: 

II _ T b T a ^ (g) 



where q 2 is the gluon momentum, T b is the SU(3) color matrix in the fundamental repre- 
sentation and T a represents the rest of the partonic scattering amplitude with color indices 
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(a) (b) (c) 

FIG. 2. Typical Feynman diagrams contributing to the quark-antiquark jet correlation in deep 
inelastic scattering: (a) leading order, where the bubble represents the gluon attachments to both 
quark lines; (b,c) two-gluon exchange diagrams. 



for the two gluons a and b. Similarly, Fig. [5(c) can be reduced to: 



i- T a T b . (10) 

-q+ + ie 

The sum of these two diagrams will be g/^—qt + [T 6 r a — r a T b ] . Because of the unique 
color index in r a , we find the effective vertex as, 

Fig.l2>,c) ^—{-ig){-ihca)T c , (11) 

q 2 + is 

which corresponds to the first order expansion of the gauge link contribution in the gluon 
distribution defined in Eq. (El). For all high order contributions, we can follow the procedure 
outlined in Ref. [!, 20[ to derive the gluon distribution. 



In particular, we calculate the differential cross section contributions from the diagrams 
of Fig. El assuming the generic coupling between the exchanged gluons and the nucleus 
target. The contributions at given order can be reproduced by the hard partonic cross 
section (given below) multiplying the TMD gluon distribution defined as Eq. (T5]) at the 
same order from the similar diagrams. This method is particular useful to identify the gluon 
distributions involved in the hard scattering processes and will be applied throughout the 
following calculations. 

Of course, to build a rigorous TMD factorization theorem for this process, we have to go 



beyond the diagrams shown in Fig. El and include the real gluon radiation contributions [17, 



181 ] . These diagrams will introduce the large logarithms of £n(Pj_/qj_), in addition to the 
small- a; logarithms in{\jx). The combination of both effects has not yet been systematically 
studied in the literature. We hope to address this issue in the future. Moreover, there have 
been discussions on the power counting method to factorize the gluon distribution from any 
generic Feynman diagrams, where one has to be extra cautions about the "super-leading- 



power" contributions (see, for example, Ref. [271]). 



From our analysis, we identified the gluon distribution involved in the quark-antiquark jet 
correlation in DIS process is the first gluon distribution at small-x. We want to emphasize 
that this result is also the unique consequence of non-abelian feature of QCD. For Abelian 
theory, we can easily find that the final state interactions between the quark and antiquark 
with the nucleus target cancel out completely. Therefore, there is no final state interaction 
effects in the similar QED process. 

Furthermore, the gauge link Li^ in Eq. (|3]) can be viewed as the sum of all the final state 
interactions between the nucleus target and the produced quark as shown in Fig. El (b). In the 
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meantime, the gauge link U\ + ^ in Eq. ([3]) takes care of the final state interactions between the 
nucleus target and the produced antiquark as illustrated in Fig. [2] (c). Therefore, following 
Ref. it is straightforward to show that the Weizsacker- Williams gluon distribution is 
the relevant gluon distribution in DIS dijet since it correctly resums all the final state 
interactions. 



A. TMD-factorization approach to the DIS dijet production 

By putting in the hard partonic cross section H^g^gq and especially the correct gluon 
distribution, namely the WW gluon distribution, which resums all the final state interac- 
tions between the qq pair and the target nucleus, we obtain the following transverse and 
longitudinal differential cross sections for the quark- antiquark jet correlation in DIS process 



, j^,A-^qq+X 
^TMD _ 



5(x r - l)xgGW(xg,q±)H 7 *g_> q9 , (12) 



= S(x r - l)x g G^(x g , qi_)H llg ^, (13) 

where x g is the momentum fraction of hadron A carried by the gluon and is determined by 
the kinematics, x 7 * = z q + Zq with z q = z and z q = 1 — z being the momentum fractions of 
the virtual photon carried by the quark and antiquark, respectively. The phase space factor 
is defined as dP.S. = dyidy 2 d 2 P±d 2 q±, and yi and y 2 are rapidities of the two outgoing 
particles in the lab frame. In terms of the rapidities and the center of mass energy -^/i, one 
can find 

|*4je yi + Me" 3 \ki±\e~^ + \k 21 _\e~^ 

x g = -= . (14) 







e yi 




e^ 1 + 




e V2 



In addition, in the correlation limit, one has |Pj_| ~ \k\±\ — \k 2 ±\ ^> \q± \ = \k\± + k 2 ±\. The 
leading order hard partonic cross section reads 

2 S 2 + Q A (U t_\ 

tl^ qq - a s a em e q ^ + Q2)4 ^ + W 
2 8sQ 2 



with the usually defined partonic Mandelstam variables s = (k\ + k 2 ) 2 = P]_/(z(l — z)), 
t= (k 2 -k r ) 2 = -(P 2 +e 2 )/(l-z), andw = (h-k r ) 2 = -(P 2 +e 2 )/z with e 2 = z(l-z)Q 2 



and z = z q . 



Finally, in the correlation limit, one obtains the differential total cross section as follows: 

i 7* A— >qqX /-, \ 



S±N c a em e 2 f 1 
x — / d r ± e 1< ?-L r -L_ 



i r / 1 m 

(17) 



I- 1 / ' J r 



where o" to t is defined as cr to t = <Jt + °~l and we have substituted the CGC result for the WW 
gluon distribution in Eq. fl5]). By taking Q 2 = 0, we can extend the above result to the 
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v=zx,+(l-z)x. 



v'=zx'+(l-z)x' 




=0£= =0= 



Amplitude 



Conjugate 
amplitude 



FIG. 3. Typical diagrams contributing to the cross section in the deep inelastic process. 



case of dijet production in real photon scattering on nuclei. The longitudinal contribution 
vanishes and the total cross section only contains the transverse part. Therefore, we obtain 



dyidy 2 d 2 P±d 2 q± 



8( X ,-l) S± ^;f« z(l-z) [z 2 + (l-z) 2 ] 



d 2 r_|_e~ 



I - oxp ( ~-^\Ql 



(18) 



For the real photon case, there will be resolved photon contributions which should be taken 
care separately following that in the dijet production in pA collisions discussed in Sec. IV 
below. 



B. CGC approach to the DIS dijet production 

The quark-antiquark jet cross section can also be calculated in the CGC formalism. 
In this setup the photon splits into a quark-antiquark pair which subsequently undergoes 
multiple interactions with the nucleus (see Fig. E]). Previous calculations performed under 



this framework [22, 28] have focused mainly on the total cross section or single inclusive gluon 
production, which are calculations involving a different color structure than the process we 
are interested in. Here we calculate the cross section in the most general case and then we 
show how the factorization formula is recovered in the correlation limit. 

At the amplitude level the process can be divided into two parts: the splitting wave 
function of the incoming photon and the multiple scattering factor. It is convenient to 
write these quantities in transverse coordinate space since in this basis, and in the eikonal 
approximation, the multiple interaction factor is diagonal. To be consistent with previous 
CGC calculations in the literature, we choose a frame that the photon is moving along the 
+z direction whereas the nuclear target in the —z direction. When we compare the results 
to those obtained in the last subsection, we have to keep in mind this difference. However, 
we note that the differential cross section does not depend on the frame. For a right-moving 
photon with longitudinal momentum p + , no transverse momentum, and virtuality Q 2 , the 
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splitting wave function in transverse coordinate space takes the form, 



^(p + ,z,r) = 2nj^ 




r-e 

"W 

r-e 
Jr\ 



(i) 



ie / ir 1 (e / |r|) n ^[5 a+ 5 |S+ (l 

(2) 

1 




z(l 



ie f K 1 {e f \r\)^[8 a -5 l 
z)QK (€ f \r\)5 a p. 



z) + Sa^Sp-z], A = 1, 
z) +6 a+ 6p + z], A = 2, 



(19) 



(20) 



where again z is the momentum fraction of the photon carried by the quark, A is the photon 
polarization, a and are the quark and antiquark helicities, r the transverse separation of 
the pair, e 2 — z(l — z)Q 2 , and the quarks are assumed to be massless. The heavy quark 
case will be considered in the next subsection. 

The multiple scattering factor is expressed in terms of Wilson lines in the funda- 
mental representation. It can be shown [28[ that this interaction term takes the form 
\U\x 2 )U (x\) — ll .. where x\ and x 2 are the transverse positions of the quark and the anti- 

L J Jl 

quark, i and j are their color indices, and the Wilson line is given in terms of the background 
field by 

(x+,x)\ . (21) 



U(x) = V exp { ig s / dx + T C A 



The gauge field is directly related to the color charge density of the nucleus which will be 
averaged over the nuclear wave function at the level of the cross section. The way the color 
indices are contracted in the scattering factor is due to the fact that the pair is initially in 
a singlet state but no assumptions are made about the final state. The color indices i and 
j will be summed over independently also at the cross section level. 

With the pieces described above we can write down an explicit formula for the differen- 
tial cross section for dijet production. After averaging over the photon's polarization and 
summing over the quark and antiquark helicities and colors we obtain, 



d 3 kid 3 k 2 



N c Q! err , 



xe 



d 2 Xi d 2 ^ d 2 x 2 d 2 x' 2 
(2tt) 2 (2tt) 2 (2tt) 2 (2tt) 2 

T,L\, \ iT.LX*, . 

a/3 (ari-xaM^ (2m 



4) 



Xa/3 



X 



1 + ^xt ( X l) X 2'i X 2i X l) Sx 2 J( X l} X 2) S x 2 ]( X 2i X l/ 



where the two- and four-point functions are defined as 

1 



SW(x 1 ,x 2 ) = w (TrU(x 1 )tf(x 2 )) Xa 



*^2' ^1 



l 



{TiU{x 1 )U\x' l )U{x' 2 )U\x 2 )) x 



(22) 

(23) 
(24) 



The notation (. . . ) Xg is used for the CGC average of the color charges over the nuclear 
wave function where x g is the smallest fraction of longitudinal momentum probed, and is 
determined by the kinematics. 

Notice that the transverse coordinates of the quark and antiquark in the amplitude (un- 
primed coordinates) are different from the coordinates in the complex conjugate amplitude 
(primed coordinates) since the two final momenta are not integrated over. This is a very 
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important feature of our calculation that, to our knowledge, does not appear in previous 
CGC calculations of DIS in nuclei. It allows for a different color structure and in particular 
it is responsible for the appearance of the 4-point function S^J which cannot be expressed 
in terms of 2-point functions, even in the large N c limit (see Appendix IB 21 for an explicit 
evaluation of the medium average). 

In order to compare with the TMD-factorization result discussed in the previous section, 
we need to consider the relevant kinematic region, in particular in the correlation limit of 
Eq. f )22p . For convenience, we introduce the transverse coordinate variables: u = x\ — x 2 
and v = zx\ +(1 — z)x2, and similarly for the primed coordinates. The respective conjugate 
momenta are P± — (1 — z)ki± — zk2± ~ P± and q±, and therefore the correlation limit can 
be taken by assuming u and v! are small and then expanding the integrand with respect to 
these two variables before performing the Fourier transform. 

Let us focus on the multiple scattering factor first. By using the following identities, 

S^(x 1 ,x 2 ;v',v') = S^(x 1 ,x 2 ) , (25) 
S^(v,v;x' 2 ,x' l ) = S%(x' 2 ,x[) , (26) 

it is easy to see that terms from the expansion of cancel the other terms in (|22p . After 
applying 

U\v) (diU{v)) = - (dirf(v)) U(v) , 

we can show that the lowest order contribution in u and u' to the scattering factor can be 
written as 

- u^iTr [WW U\v') [d U{v')} U\v)) Xg . (27) 

Taking into account the path ordering of the Wilson lines, we have the following formula for 
their derivatives, 

/oo 
dv + U[-oo,v + ;v] (d i A-(v + ,v)) U[v + , oo; v], (28) 
-oo 

where U[a, b;x] = Vexp{ig s dx + T c A~(x + , x)\. We notice that (diA~(v + , v )) is part of 
the gauge invariant field strength tensor F l ~(vji. Therefore, the above correlator can be 
written in terms of gauge invariant matrix element, 

/oo 
dv + dv' + (Tr [F*-(^)W [+lt F J '-(i/)WW]> . 
-oo 

(29) 

Performing the u and v! integration in ( |22l) after the expansion of the multiple scattering 
term, we find an explicit formula for the differential cross section in the desired kinematic 



6 The other part of the the field strength tensor shall come from the transverse component of the Wilson lines 
as the gauge invariance of QCD requires. When the A + = gauge is used the only non-zero component 
of the gauge field is A~ 2S] and the transverse parts drop out of the equations, giving a simpler form of 
the equations. 
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region, 



dvs — = a em e q a s 5 (x r - 1) z(l - z) (z + (1 - z) ) ( p2 +e 2) 4 

X (16 7T 3 ) J ^^^•(W) 2 ^^i- (u)w [ + ]t F i-( u /) W [+]]>^ ; (go) 
da^ A ^ x 8PM 



a Prn e„a,5 (x~* — 1) z (1 — z) 



X(167r3) / 9^ e ~^' (W)2 • (31) 

These results are to be compared to the factorized results in Eq. (I12|I13I) . The hard cross 
section factor in (1T21) is recovered by noticing that in the kinematic region we are considering 
the Mandelstam variables are given by s = P]_/(z(l — z)), t = — (Pf_ + e*)/(l — z), and 
u = —{P"x_ + e })/ z - To recover the gluon distribution function as written in Eq. (J3]) it 
is necessary to account for the different normalizations used to calculate the average of 
Wilson lines above. In Eq. ([3]) the average is calculated with a definite momentum (and 
therefore translational invariant) hadronic state \P) which is relativistically normalized to 
(P'\P) = (2tt) 3 2P+(5(P + - P /+ )£( 2 )(P ± - P[), while the average in Eqs. Q3QJ and §3) is 
taken over the CGC wave function and is normalized such that = 1. Using translational 
invariance Eq. ([3]) can be written as 



xG (1 \x,k ± ) 



(P|P) J (2tt) 3 
x(P|Tr [F + \^,^)U^F + \^,U)K [+] ] \P) ■ (32) 



It is easy to see that the discrepancy between normalizations is accounted for by the re- 
placement ^pjff — * (■■■}x g , giving complete agreement between the CGC approach and 
the factorized form in the small-x region. 

In the end of this subsection, we would like to compare the dijet production process in DIS 
to the inclusive and semi-inclusive DIS. As shown above, we derive that the dijet production 
cross section in DIS is proportional to the WW gluon distribution in the correlation limit. 
On the other hand, it is well-known that inclusive and semi-inclusive DIS involves the dipole 
cross section instead (26[, which can be related to the second gluon distribution. This might 
look confusing at first sight, so let us take a closer look at Eq. (1221 . If one integrates over 
one of the outgoing momenta, say k\, one can easily see that the corresponding coordinates 
in the amplitude and conjugate amplitude are identified {x% = x' x ) and, therefore, the four- 
point function Sxg( ) collapses to a two-point function S^(x2, x' 2 ). As a result, 
The SIDIS and inclusive DIS cross section only depend on two-point functions, thus they 
only involve the dipole gluon distribution. Now we can see the unique feature of the dijet 
production process in DIS. By keeping the momenta of the quark and antiquark unintegrated, 
we can keep the full color structure of the four-point function which eventually leads to the 
WW gluon distribution in the correlation limit. Therefore, measuring the dijet production 
cross sections or dihadron correlations in DIS at future experimental facilities like EIC or 
LHeC would give us a first direct and unique opportunity to probe and understand the 
Weizsacker- Williams gluon distribution. 
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C. Heavy quark production in DIS dijet 



In order to expand our calculation and include the possibility of charm and bottom 
production, we now consider the finite quark mass case. From the TMD point of view, 
having massive quarks modifies the hard cross sections while the parton distributions remain 
the same. The new leading order hard partonic cross sections read 



Hry^g—tqq 0£ s O£ em €qZ (1 Z^j 



p;+4 f» + i\ + 2m l p l 



{Pl + e'fY\t uj z(l- z)(Pl + e'fY 



(33) 



2 8Q 2 f m\ t 

n i* L 9^gg oi s a em e q ^ ^ 2 ^ 4 i s z (l — z) ' ' 



where s = [k x + k 2 f = (Pi + m 2 q )/(z(l - z)), t = (k 2 - k r f ~ m 2 q = -(P 2 + ef)/(l - z), 
and u = (ki - k^*) 2 ~ m \ = —(P± + e 'f)l z wrtn e' 2 = z(l - z)Q 2 + m 2 q and z = z q . 

In terms of the CGC approach, one needs to modify the dipole splitting wave functions 
as follows: 



^\p + ,z,r) = 2n x l^{ 




ie / /J fTi(e / / |r|)-p|-[(5 Q+ 5 ( 3 + (l - z) + 5 a -5 /3 _z} 

+5 a -5/3 + mqKo(e' f \r\), A = 1 

r . e (2) 

ie' / i^i(e / / |r|)-p|-[5 Q _5 ( g_(l - z) + 5 a+ 5p + z] 
k +5 a+ 5^m q K (e' f \r\), A = 2 



^(p + ,z,r) = 2vr^2:(l - z)QK (e' f \r\)8 a p . 



(35) 



(36) 



Following the same procedure, it is easy to show that again both approaches agree in the 
correlation limit for heavy quark production. By setting Q 2 = 0, one can get the results for 
the heavy quark production in real photon-nucleus scattering. 

III. DIRECT-PHOTON JET IN pA COLLISIONS 

Now let us turn our attention to the second gluon distribution. In this context, the sim- 
plest process where we can access this distribution is the direct photon-quark jet correlation 
in pA collisions, 

pA -> j(ki) + q(k 2 ) +X , (37) 

where the incoming quark carries momentum p, and nucleus target with momentum P4, 
and outgoing photon and quark with momenta k± and k 2 , respectively. The analysis of this 
process follows that for the quark- ant iquark jet correlation in DIS process in the previous 
section. We plot the relevant diagrams in Fig. H^a,b,c), again for the leading one gluon 
exchange and two gluon exchanges. Similarly, the two gluon exchange contributions can be 
summarized aa3 

Fig. c) ~ {-ig) ( ' , T b T a + -^—T a T b ) , (38) 



7 There is a misprint, which we have corrected below in the eg. (1381) . in the Eq.(ll) of the short summary 



151 ] of this paper.) 
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(b) 



FIG. 4. Same as Fig. ([2]) for direct photon-jet correlation in pA collisions. 

where the plus sign comes from the fact that the second gluon attaches to the quark line in 
the initial and final states. Since there is no color structure corresponding to Eq. (l3~8j) . we 
can only express it in the fundamental representation. Following Ref. 0], we find that the 
gluon distribution in this process can be written as 

xG {2) (x,k ± ) = 2 J ^fp + e ixP+r ~ lk±<± (P\^ [F +4 (r,a)W Ht F+*(0)WM] \P) , (39) 

where the gauge link ' = U n [0, — oo; 0] U n [— oo, resums the initial state interac- 

tions between the incoming quark and the target nucleus. On the other hand, the gauge 
link WW represents the final state interactions between the outgoing quark and the target 
nucleus. This gluon distribution can also be calculated in the CGC formalism where it is 
found to be 

xG^(x,q ± )^^-S ± F Xg (q ± ), (40) 

with the normalized unintegrated gluon distribution F Xg (q±) = J j7^e~* qj -' rj -S x 2 g \o, r±). 
Therefore, by plugging in the appropriate gluon distribution, namely the dipole gluon dis- 
tribution, which resums both the initial and final state interactions, one can write the 
differential cross section of ( )37|) as^| 

~$p~S = 2^ x pQf( x p) x 9 G ( x 9> V±) H Qg^~f<i > ( 41 ) 

where x\ is the momentum fraction of the projectile nucleon carried by the quark, qf{x{) 
is the integrated quark distribution. Because we are taking large nuclear number limit, the 
intrinsic transverse momentum associated with it can be neglected compared to that from 
the gluon distribution of nucleus. The hard partonic cross section is given by 

H ^-^^{~- 1 )- ( 42 ) 

Inserting Eqs. fj4"Q]) and ( H2l) in Eq. f )4Ti) . one gets 

da ( P A-^ q+ x) 2 [1 + (1 - *) 2 ] (1 - z) 

~^ps — = 2,^W-2^ S ^^ pT ' ^ 43 ) 



Here we assume that one can employ the collinear factorization for the integrated quark density or gluon 
density inside the dilute proton at large x p , although the proof of this assumption is omitted throughout 
this paper. We will leave this study for future work. 
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where we have expressed the Mandelstam variables in terms of P± and z: s = {k\ + k-z) 2 = 
z (±t z y u — (k± — p) = — j- and t = (k 2 — p) = — y^. The momentum fraction of the 
incoming quark p carried by the outgoing photon z is defined as 

\ki±\e yi 



(44) 



where y\ and yi are rapidities of the photon and outgoing quark in the Lab frame. 

The current running RHIC and LHC experiments shall provide us some information 
on the dipole gluon distribution by measuring direct photon-quark jet correlation in pA 
collisions. 



A. CGC approach to the direct photon-jet production in pA collisions 



This process was already considered in the CGC framework in 33[ where the calculation 
was performed entirely in momentum space. In order to compare with the result from the 
previous section and illustrate why a different distribution should be used, we will derive the 
corresponding cross section following the same procedure as the previous section by showing 
the splitting wave function and the multiple scattering factor in transverse coordinate space. 
Our result is consistent with 33 . 

Let us consider the partonic level process q — > q^. For a right-moving massless quark, 
with initial longitudinal momentum p + and no transverse momentum, the splitting wave 
function in transverse coordinate space is given by 



+ av) 



2ni 




{da-dp- + (1 - z)5 a+ 5 j3+ ), A = 1, 



H<W<W + (!- z)S a Sp-), A = 2. 



(45) 



where again A is the photon polarization, a, (3 are helicities for the incoming and outgoing 
quarks, and z is the momentum fraction of the incoming quark carried by the photon. To 
account for the multiple scatterings in this process we have to consider interactions both 
before and after the splitting. If the transverse coordinates of the quark and photon in the 
final state are b and x respectively, then the multiple scattering factor in the amplitude takes 
the form U(b) — U(zx + (1 — z)b). 

After summing over final polarization, helicity and color, and averaging over initial helicity 
and color, we find the following expression for the partonic level cross section (see Fig. [5]). 



da qA ^ x 
d 3 kid 3 k2 



ie 2 Mp + -kt- H 



d 2 x d 2 x' 



d z b d z b 



2 



xe 



-ik 1± -(x-x') -ik 2 ±-(b-b 



(2tt) 2 (2tt) 2 (2tt) 2 (2tt) 2 
'^^'-^(z-fc) 

\a/3 

x 5g)(6, b') + S<g(zx + (1 - z)b, zx' + (1 - z)b') 
-S£>(&, zx' + (1 - z)b') - SS2(zx + (1 - z)b, b') 



(46) 



Notice that the color structure is simpler than in the DIS case. There is no four-point 
function and all the terms in the multiple scattering factor can be expressed in terms of 
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v=zx+(l-z)b 



v'=zx'+(l-z)b' 




Amplitude 



Conjugate 
amplitude 



FIG. 5. Interactions before and after the splitting have to be taken into account for both amplitude 
and conjugate amplitude. Here is a typical diagram representing the third interaction term in Eq. 

TSD. 



(2) 

the color dipole cross section SxJ ■ By changing the variables on each of the terms of the 
scattering factor to u = x — b and either v = b or v = zx + (1 — z)b , and similarly for the 
primed coordinates, the cross section above can be written as 



foqA-Hr/X + r d 2 u & 2 u , d 2 v ^ 

d 3 fcid 3 fc 2 - aem6 i d{p ^ (2tt) 2 (2tt) 2 (2tt) 2 (2tt) 2 

\a/3 



X 



X 



e —iu-(P±+zq±) e iu'-(P±+zq±) _|_ e -iu-P ±e iu'-P± 



_ e -iu-(P ± +zq ± ) iu' -P± _ -iu-P± iu' -{P ± +zq ± ) 



(47) 



where P± = (1 — z)k±± — zk 2 ± ~ P±. 

From the above expression it is easy to see that performing the u and v! integrations 
reduces to taking the Fourier transform of the splitting wave function with different values 
of the momentum variable for each term. Clearly, the Fourier transform of the dipole cross 
section factors out giving the gluon distribution we found from the TMD-factorized form. 
Using collinear approximation for the proton projectile we find our final result for the cross 
section of the desired process. 



da pA^ iq +x 
dV.S. 



z)—. 



Y] x p q f (x p )a em e 2 f N c [l + (1 - z) 2 ] z 2 (l . . 



2q\ 



x 



(2tt) 2 (2tt) 



(48) 



This result agrees with the factorized result ( 14TT) in the correlation limit P± ^> q±. To make 
more clear the relation between the distribution xG^ in Eq. ([6]) and the result above notice 
that the factor q\ can be brought inside the integral as derivatives of the exponential factor 
with respect to v and v'. Using integration by parts and the derivation formula for Wilson 
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lines it is easy to show that the cross section takes the form 

da dV s. = EwfeM I 1 + C 1 - z ) 2 ] z2 i l - *) P 2 {P± 2 + zq±)2 

x 167r 3 a s / ,f^,f^ e " gX,(W) < Tr [i ?i "(^)WHt j p i -(^)WW] ) . (49) 

7 (27T J (27Tj d 

Taking into account the same considerations about different normalizations of the averaging 
procedures as in the DIS case, it is easy to see that the two expressions for xG^ 2 ' agree in 
the small-x region. 



IV. DUET PRODUCTION IN pA COLLISIONS 

Dijet production in pA collisions receive contributions from several channels such as 
Q9 ~^ Q9i 99 ~~ QQ an d 99 ~~ 99- F° r convenience, we define the following common variables 
as in the last two sections, 



\k l± \e yi \h±\e yi + \ k 2± \e V2 \ k l± | e~ yi + \ k 2± \ e' m 

Z ~ |fc 1 _ L |ew + |fc 2 _ L |e«*' Xp ~ 7^ ' X " ~ 7^ 



(50) 



where k\ and k 2 are momenta, and y\ and y 2 are rapidities for the two outgoing particles, 
x p is the momentum fraction of the projectile nucleon carried by the incoming parton, x g 
is the momentum fraction of the target nucleus carried by the gluon, respectively. Taking 
into account that the quark distribution functions are dominant at large- x and the gluon 
distribution functions are dominant at low-x, it comes as no surprise the fact that different 
channels are relevant in different kinematic regions. At RHIC energies, the low -x region is 
only accessible in events where the two jets are produced in the forward rapidity region of 
the projectile. Under those conditions we have x p ~ 0.1 and x g 0.1, and therefore quark 
initiated processes dominate (qg — > qg channel). 

The higher energies available at LHC will allow to explore more thoroughly the low-x 
regime in the target nucleus as well as in the projectile (see e.g., in a recent study [34J). Under 
these circumstances, and in particular at central rapidities at the LHC, it is possible to have 
processes with both x p and x g small where the dominant channels are gg — > qq and gg — >■ gg. 

Let us first take the partonic channel qg — > qg as an example and calculate the dijet 
production cross section. Then it is straightforward to generalize the calculation to the 
other partonic channels gg — > qq and gg — >• gg. 



A. TMD-factorization approach 

1. The qg — > qg channel 

The calculations follow the previous examples. However, there are several different Feyn- 
man graphs contributing to the production of qg in the final state, as shown in Fig. In 
addition, they have different color structures. Therefore, we need to compute the hard fac- 
tors and the associated initial/final state interaction phases separately. In the end, we will 
sum their contributions together to obtain the final result. 
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FIG. 6. Quark-gluon scattering diagrams. The mirror diagrams of (3), (5) and (6) give identical 
contributions. 



TABLE II. The color and hard factors for the qg —> qg scattering channels in Fig. [6J where 
C F = (N 2 - 1)/2N C . 





(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


h 


A(P-su) 2 
Psn 


2{u 2 +P) 
su 


2(P-su)(u-t) 
stu 


2{s 2 +P) 
su 


2{P-su)(s-t) 
stu 


2P 
su 




1 

2 


C F 
2N C 


1 

4 


Cf 
2N C 


1 

4 


1 



It is straightforward to obtain the hard cross section contributions from each diagram in 
Fig. [6] for the qg — > qg process, and have been calculated in Ref. [3j. We list these results 
in Table [Til with the same notations, where is the partonic hard factor and Cu is the 
associated color factor. In the calculations, in order to apply the eikonal approximation 
when multiple gluon interactions are formulated, we have chosen the physical polarizations 
for the outgoing gluon. However, the final result for the differential cross section does not 
depend on this choice. 

As a consistency check, we can easily reproduce the known results for the total hard cross 
section by summing all the graphs in Fig. [6] and explicitly taking N c = 3, 

f Cm - w) = f E c^"* + 2 E c ?h4 

U=l,2,4 1=3,5,6 J 

g A U s 2 + u 2 s 2 + u 2 \ 
16tts 2 V9 -su t 2 ) [ ' 

Since the graphs in Fig. [6] have different color structure, the gluon distributions associated 
with those graphs have different gauge links according to Ref. Therefore, the corre- 
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sponding gluon distributions in coordinate space are found as follows: 



9 



$(2) 

a 



$(3) 
a 



Tr 



Tr 



Tr 



(52) 



F( ^^ [+lt+ ^ h T (0)MH 



Tr rzypn 

F (0 ^ J ^ [+]t F(0)^+] 



d>(4),(5),(6) = (Tr [F(^HtF(0)#l]), 



(54) 
(55) 



where = WWwH f = l/H^M emerges as a Wilson loop. Now we are ready to combine 
all the channels together. As mentioned in the introduction, the distributions above will 
be factorizable in terms of convolutions of the two basic distributions from the previous 
sections. Anticipating this result, we consider only the leading contribution in N c . Noting 
that graph (6) in Fig [6] does not contribute in the large- N c limit, one can find 



J qA^qgX 
"°TMD 

d 2 P±d 2 q±dyidy2 



f 



■p) P 



hr(i)#(i) + jK2)#(2) ] 
L ia qa^qa 1 ia qg-^qgl 



(56) 



with 



x-(i) 
J qg 



7T(2) 

J qa 



d£ d^± 
(2vr) 3 P^ 



(Tr [F (OW Ht F(0)W [+] ]) 



JxP+S, -iq±-£± 



Tr 



Nr. 



(57) 
(58) 



In the large- N c limit, it is straightforward to find that only graphs (1), (2) and (3) in Fig. [6] 

(2) 

(t and u channels together with their cross diagrams) contribute to HqgUqg and only graphs 
(1), (4) and (5) (t and s channels together with their cross diagrams) contribute to HqgU qg . 
By using = | in the large- iV c limit, one obtains 



aa^qa 



qa^qa 



iP + s 2 



(i 2 - su) (s - i) 



siit 



ii 2 (s 2 + u 2 ) 
2sui 2 



siit 2 2 su 
(i 2 -sii) 2 1 t 2 + u 2 (t 2 - su) (u - t) s 2 (s 2 + u 2 ) 



siit 1 



su 



siit 



2siit 2 



(59) 



(60) 



We note that although the individual diagram's contribution to the above two hard factors 
depends on the polarization we choose for the outgoing gluon, the final results for the hard 
factors do not depend on this choice. This means the combination of Feynman graphs 
according to the relevant color structure is gauge invariant. Similar conclusion has also been 
obtained for the spin related observables calculated in Refs. 0, [sj] . 



Since one has s 



2(1-2)' 



u 



■— and t = —jz^ in the correlation limit, Eq. (J56J) leads 
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(1) (2) (3) 




(4) (5) (6) 

FIG. 7. gg — > qq scattering diagrams. The mirror diagrams of (3), (5) and (6) give identical 
contributions. 



TABLE III. The color and hard factors for the gg — > qq scattering channels in Fig. [7J 





(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


h 


2(3t 2 +u 2 )u 

(i+u) 2 i 


2(t 2 +3u 2 )i 
(i+u) 2 u 


2(i-u)' 2 
(t+u) 2 


(i+u) 2 


(i+u) 2 


Aiu 

(i+u) 2 




i 


1 

4N C 


1 


N c 


N c 


N c 


W c (N 2 -l) 


2(N 2 -1) 


4(N 2 -1) 


4(N 2 -1) 



to the following cross section for qg dijet production in pA collisions 

j P A-+qgX 
TMD 



d 2 Pj_d 2 q J _dy 1 dy 2 




where x p qf(x p ) is the integrated quark distribution for the proton projectile. 



2. The gg —> qq channel 

Following the same procedure illustrated in the qg — >• qg channel, we can calculate the 
dijet production cross section from the gg — » qq channel. First of all, we compute the color 
factors and hard factors for each graph in Fig. [7J and list them in Table IHIl Then, we plug 
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in the appropriate gluon distributions^ as found in Ref . [2} . 



$(1).(2) 

a 



$(3) 

a 

*(4),(5),(B) 

a 



Tr 



F(0)U [ 



-N c <Tr [F (0 U [D] ] Tr [F (0) W [n]t ] ) 
Tr[F(0^ [ - ]t F(0)^W] L " J 



-^<Tr[F(OWP]]Tr[F(0)WW]> 



(62) 
(63) 



(64) 



Combining all the channels in the large N c limit, we can find 



J-aA^qqX 2 
TMD _ / 

<PP^q x dy x dy 2 Z-,W V ), 2 



J aa aa^qq ~ J aa aa^qq 



7(2) lt(2) 



(65) 



with 



r(l) = 9 / ^ ^-L 

99 ' (27^) 3 P^ 



Tr 



5^3^Ht F ( )Z/[ 



77(2) = 9 / g jEi ^p+r-^i-e 

39 / (2vr) 3 P+ 



;Tr[F(0WW]Tr [F(0)WP]> 



(66) 
(67) 



and 



H. 



(i) 

aa^qq 



H, 



(2) 



aa^qq 



1 2(t 2 + n 2 ) : 
4iV c $2#£ 

1 4(t 2 + £ 2 ) 
4iV r s 2 



(68) 
(69) 



where x p g(x p ) is the integrated gluon distribution in the proton projectile. 



3. The gg —> gg channel 

Similarly, the color factors and hard factors for all the graphs plotted in Fig. [8] have been 
calculated and listed in Table [TV] Combining these factors with the corresponding gluon 



9 We have simplified these gluon distributions by using large- N c limit and the fact that they are real in the 
CGC formalism. 
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(1) 



(2) 



(3) 






(4) 



(5) 



(6) 



FIG. 8. gg — > gg scattering diagrams. The mirror diagrams of (3), (5) and (6) give identical 
contributions. 



TABLE IV. The color and hard factors for the gg — > gg scattering channels in Fig. 



2(s 4 +4s-H+lls J t 2 +l0st- i +4:i 4 ) 
(s+t) 2 s 2 



C u 



(1) 



2(2s b +&s ! H+l4s 4 P+20s*P+21s 2 i 4 +l4st b +\Wj 
(s+t) 2 s 2 t 2 



(2) 



(2s 4 +5s'- i t+10s 2 t 2 +10sF +4t 4 )(s+2t) 
(s+t) 2 s 2 t 



(3) 



2(s 4 +S 3 i+5s 2 t 2 +6st- i +2t 4 ) 
(s+t) 2 S 2 



N 2 
2(iv/-l) 



NT 
N 2 -l 



(4) 



(5) 



2s 5 +s 4 i-$ 3 t 2 - 10s 2 P - 12§i 4 -4i 5 



(s d + I0s 2 t+12st 2 +4PJT 
(s+t) 2 S 2 



N 2 
71 — 



(6) 



N 2 
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distributions, taking into account the appropriate gauge links [2j, we arrive at 



+ /lV[F(e)wHt F (o) W W] 

$(3) = ^Tr [F(OWWt F (0)WW] - |P - ^ 
+ ^<Tr[F(OWP]]Tr[F(0)WW]), 
$(4),(5),(6) = / Tr [ F(e)W Ht F(0)w [+]] Tr ^] 

-^<Tr[F(OWP]]Tr[F(0)W»]). 
Summing over all the channels in the large- N c limit, we can obtain 



i gA^ggX 
TMD 

d 2 P_ L d 2 q_ L dy 1 dy 2 



(70) 



(71) 



(72) 



/_ u - L py\ d 'P) S 2 1/ ss 1I gg^gg ~ ^ gg 11 gg^gg T -^99 11 gg^gg\ > v' d / 



where Jpp 2 ^ have been defined in Eqs. (I66II67I) and J 7 ^ is defined as 



(3) 



J 7 ^ = 2 
^99 z 



W(flW)*"] T ] T ] ) 



(27r) 3 P+ 
The hard factors are found as 



gg^gg 



2 (s 2 - fo) 

99^99 - £ 2 ;2 



U 2 t 2 S 2 
2 



gg^gg 



uts z 



.(74) 



(75) 



Using the mean field approximation [7j, we can simplify the gluon distributions and find 
the total dijet production cross section which includes the qg — > qg, gg — >■ qq and gg — >■ 
channels as follows 



^(pA^Dijet+X) 



^ £2 r 99 11 qg^qg ~ ^qg 11 qg^qg\ 



+ xig{x x )-^ 



J gg \ gg^qq ~ 2 I 

+J r(2) ( R m _ , 1^(2) A , lr(3) ff (3) 

P*? \ 99^QQ n na—h-nn I J <ia - ij -<ki- 



99 99^99 



(76) 
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v=z>c+(l-z)b 



v'=zx'+(l-z)b' 




Amplitude 



Conjugate 
amplitude 



FIG. 9. Interactions before and after the splitting have to be taken into account for both amplitude 
and conjugate amplitude. After the splitting the nucleus interacts coherently with the quark-gluon 
system. Here is a typical diagram representing the second interaction term in Eq. (|78p . 

where again q(x\) and g{x\) are integrated quark and gluon distributions from the projectile 
nucleon. We have included a statistical factor of | in Eq. (JTBT) for the gg — > gg channel due 
to the identical final state. The various gluon distributions of nucleus A are defined as 

7-W = xG^(x,q ± ), Jg> = J xG^\q x )®F{q 2 ) , 

7® = I xG^{q x )®F{q^ 7g) = - / q -^^xG^( qi ) ® F(g 2 ) , 

7-® = I ® F(g 2 ) ® F(g 3 ) , (77) 



where ® represents the convolution in momentum space: j ® = j d 2 q\d 2 q25 <y2 \q±_ — q\ — q-i). 
These expressions follow directly from Eqs. (l57|) . ( |58|) . (|66|) . ( |67|) . (J7i|l and the assumption 
that in the large- iV c limit the expectation values involved in these equations can be factored 
as products of expectation values of the traces within. Clearly, this process depends on both 
UGDs in a complicated way, and the naive TMD-factorization does not hold. 



B. CGC Calculations 



1. 



qg 



This process is studied in detail in Refs 
the approach we have followed in this paper 



35l . |36| , and in particular Ref . 



is close to 



where an explicit formula analogous to the 
ones cited here for DIS and photon emission is given. We take as starting point Eq. (24) in 
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Ref. 36) which in our notation takes the form (see Fig. [9]) 



da qA^qgX (■ d 2 x & 2 X , ^ 

asCpoyp — ki — fcj 



dPhdPfa v J (2tt) 2 (2vr) 2 (2vr) 2 (2tt) 2 

Xe -ik lx .( x -„') e -ik 2X -(b-b') - b')^{x - b) 



\a/3 

X 



S^(b, x, b', x') - S®(b, x,zx' + (l- z)b') 
-Sf\zx + (1 - z)b,x' , b') + Sf>(zx + (1 - z)b, zx' + (1 - z)b')} . (78) 



where 



S^(b,x,b',x') = -L- (Tr (C/(6)C/t(6')T d T c ) [iy(x)iyt(a;')] cd ) :Cg , (79) 
S^(b,x,v') = ^ <Tr (f/(6)T^t ( ,/ )TC ) ^(a;))^ , (80) 

and W(x) is a Wilson line in the adjoint representation. In the correlators above, Wilson 
lines in the fundamental representation appear when considering the multiple interaction 
of a quark with the nucleus and Wilson lines in the adjoint representation appear when 
considering multiple interactions of a gluon with the nucleus. Clearly, the Si 6 g term rep- 
resents the case where interactions occur after the splitting both in the amplitude and in 
the conjugate amplitude, the Sx g terms represent the interference terms, and the Sx g term 
represent interactions before the splitting only. 

This formula for the cross section has the same structure as Eqs. (122 p and (I47p . The 
splitting wave function is the same as in the photon emission case (Eq. f l45|) ). The only 
difference in the emission vertex is a color matrix which is included as part of the multiple 
scattering factor (therefore confining the color algebra to just the multiple scattering factor). 
Using Fierz identities, the terms appearing in the multiple scattering factor above can be 
written in terms of fundamental Wilson lines only as 

S®(b,x,b',x') = ^JLr ^Tr (U(b)U\b')U{x')U\x)) TiU{x)U\x') - l-TrU(b)U^b') 

(81) 

S x 3 g ] (b,x,v') = — 1— (TrU{b)rt(x)TrU{x)rtW) - ^-TrU(b)tf(v')\ , (82) 



Some of the correlators appearing in the expressions above are familiar or have been 
calculated in the literature before. The 4-point function in S Xg is different from the one 
appearing in the DIS case but it has been studied and calculated in a model with Gaussian 
distribution of sources in j3lj. The 6-point function appearing in S X 6 J presents a more 
difficult challenge even with only four independent coordinates. In order to deal with this 
difficulty, it is convenient to address the problem in the large- N c limit where correlators 
of products of traces are evaluated as product of correlators of traces. Specifically, for the 
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v=zx+(l-z)b 1 v'=zx'+(l-z)b 




FIG. 10. Graphical representation of the splitting q — > qg in the large- iV c limit, in the amplitude 
and the conjugate amplitude. 

correlators above we get 

S®{b,x,l/,J) ~ i (Tr(/(ft)[/t(i')(/( x y(x)) is flH7(x)l7t(x / )> , (83) 

c 

= S$(b,xXJ)S2!>M, (84) 
SMfaxJ) ~ <W(6)f/t( x) )^ f]itr(x)l7t(«0> , (85) 

c 

= 5«(6,x)5«(x,t/). (86) 

Note that in the large- iV c limit, the 6-point function is related to the 4-point function that 
appeared in the DIS dijet case. In Appendix IB 21 we point out that the result of 36| for the 
6-point function misses the inelastic part of . 

To enforce the correlation limit we follow the procedure used in the DIS case. From the 
structure of the terms in the multiple scattering factor, we can see that the same kind of 
cancellations will occur and the final result will be the sum of the lowest order non-vanishing 
terms from the expansion of Si 6 J . Moreover, since there is no linear term in the expansion 

of SxJ, the lowest non- vanishing terms come separately from the S^J factor and the S^J 
in the same fashion as in the previous calculations for DIS and photon emission. With the 
previous considerations in mind, it is easy to see that the final result takes the form 



diq ± dip ± d yi dy 2 ^ ™ v P 



d 3 v d 3 v' 



e -ig_i_ •(«-«') 



z? 



(Tr [F i -(w)W [ - ]t F i -({/)W [+1 ]) 3 



(2tt) 3 (2tt) 3 

+4- (TtU(v)UUv')) , (Tr \F i -(m^F i -(v')U^\) 



(87) 



Taking into account the difference between the normalizations, it is straightforward to 
see that the result above agrees with the factorized formula (jBTj) . 

In order to bring some insight to the relation between the processes considered so far, and 
how the different distributions come in for this particular channel, it is useful to consider 
the graphical representation of the large- N c limit used to factorize the correlators of Wilson 
lines. In the large- N c limit, a gluon line can be effectively considered as a quark-antiquark 
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pair. Forgetting about the multiple interactions for the moment, and focusing primarily in 
the color flow of the process, we see that in the large- N c limit the process takes the form 
depicted in Fig. [TUJ The system splits into two separate pieces, a quark line in the lower 
part of the diagram which resembles the photon emission process, and a loop in the upper 
part of the diagram which has the same color structure as the DIS dijet case. Interactions 
involving both parts of the process are ^-suppressed, so it comes as no surprise that the 
final result can be written as two separate pieces each involving the respective distributions 
found for DIS and photon emission. 

The fact that one of the terms in the final result involves only one of the distributions 
while the other involves a convolution of two factors can also be understood in a simple way 
from the previous considerations. The enforcement of the correlation limit is schematically 
the same as singling out one hard scattering in the process and then taking u = u' = 
for the rest of the interactions. When the hard scattering occurs on the lower part of the 
diagram in Fig. [TUJ the quark-antiquark pair in the upper part does not interact by color 
transparency (S x 4 g \b, b; b', b') = 1) and therefore there is no trace of it in the first term of the 
factorized expression. When the hard scattering occurs in the upper part of the diagram in 
Fig. QUI the quark in the lower part still interacts with the nucleus (and exchanges transverse 
momentum) and therefore has to be included in the form of a dipole cross section. 



2. g 



Following the same strategy from previous sections, we start with the partonic level 
formula for the cross section built from the splitting wave function and the multiple scattering 
factor. In this particular case it takes the following form, 



d a 9A->qqX 

d 3 kid 3 k2 



as5 ( p + _ k + _ J 



xe 



x 



-ik 1± -(xi-x' 1 ) -ik 2 ±-('X2- 



d x\ d x[ d %2 d x 2 
(2tt) 2 (2tt) 2 (2tt) 2 (2tt) 2 

\a(3 



C Xg (xi,x 2 ,x' 1 ,x' 2 ) + S x (zxi + (1 - z)x 2 ,zx[ + (1 - z)x' 2 ) 



—S^J(xi, zx[ + (1 - z)x' 2 ,x 2 ) - S^J(x' 2 , zxi + (1 - z)x 2 , x[) 
where S Xg is given by (1821) and 



Cx g 2-2; 2-1; X<2) 



C F N, 



(Tr(tf(x 2 )T c U(x 1 )U\x' 1 )T c U(x' 2 ))) s 



S: 



(88) 



(89) 
(90) 



and the splitting wave function is the same as in the DIS case with Q 2 = 0. Notice this 
cross section is down by a factor of N c as compared to the q — )■ qg case. This is due to the 
averaging over the incoming particle which amounts for a factor of N $_ 1 for gluons instead 

of the factor of 4- for quarks. 

All the correlators above have been previously studied in the literature and explicit ex- 
pressions for a Gaussian distribution of charges have been found. The only new ingredient 
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that has not been considered in previous sections is C x which was thoroughly studied in 
(io| . Following the procedure from the previous section, let us express the correlators defined 
above in terms of fundamental Wilson lines only by means of Fierz identities. 

C Xs (x 1 ,x 2 ,x' 1 ,x' 2 ) = ^L^(TrU(x 1 )U\x' 1 )TrU(x' 2 )U\x 2 ) 

- ^-TtU(x 1 )U\x' 1 )U(x' 2 )U\x 2 )\ , (91) 
S $,M = aT^T {™{v)U\v')TrU{v')U\v) - l) . (92) 

c 

We take the large- N c limit in order to be able to compare with the results from the previous 
section and relate the cross section to the gluon distributions defined before. Under this 
approximation, the correlators above can be expressed entirely in terms of 2-point functions. 

C Xg (x 1 ,x 2 ,x' 1 ,x' 2 ) ~ S x 2 J(x 1 ,x' 1 )S^(x' 2 ,x 2 ), (93) 
S*(v,v>)^SW(v,v>)SW(v>,v). (94) 

This way of factorizing the correlators and the fact that the 4-point function is absent 
suggests that this process is related to the distribution given by the Fourier transform of the 

(2) 

dipole cross section only. With this consideration in mind, we Fourier transform all the S Xg 
factors and perform the usual change of variables u = x± — x 2 and v = zx\ + (1 — z)x 2 (and 
similarly for the primed coordinates) and obtain 

da9 A->qqX i f d 2 u d 2 u , ^ 



d 3 kid 3 k 2 



xe -i(q±-qi-<]2)-(v-v') 



x 



e -iP±iu-u')J2^( U ')^(u) 
e i((l-z)q 2 -zq 1 )-(u-u') _ e i((l-z)q 2 -zq 1 )-u _ e -i((l-z)q 2 -zq 1 )-u' _|_ ^ 



(95) 



As in the photon emission case, the u and u' integrations reduce to calculate the Fourier 
transform of the splitting wave function with different momentum variables for each of the 
terms. The v and v' integrations give a ^-function relating the momentum variables of the 
two distributions and a factor of the total transverse area. As in previous cases, we use 
collinear factorization for the incoming parton from the proton projectile and obtain 

da pA ^ q Q x S i 

x P gf(x p )a s [z 2 + (1 - z) 2 ] z(l - z)- 



dV.S. v ° L 'V / j ~V > ( 27r ) 5 



X 



/ d W (2) (^ - ?i - q2 )F Xg ( gi )F Xg ( q2 ) l Zqi V f 2)2 (96) 
J P 2 (P ± + z qi -(l-z)q 2 ) 2 



In the correlation limit, the denominator of the last fraction above becomes just Pf_. From 
this expression it is clear that the distributions involved will be written as a convolution of 
two factors involving the Fourier transform of the dipole cross section. To notice how this 
equation above agrees with the factorized form in (1651) . expand the numerator and write the 
momentum factors as derivatives with respect to transverse coordinates of the dipole cross 
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FIG. 11. Above: graphical representation of the splitting in the amplitude and conjugate amplitude. 
Below: splitting in the large- A c limit. 



sections inside the definition of F Xg as was explained for the case of photon emission. There 
is a subtlety concerning the relative signs of the different terms when this identification is 
made. In order to find a complete agreement between the formula above and the factorized 
formula from the TMD formalism, it is necessary to write the two F Xg factors as Fourier 
transforms of Wilson loops in opposite directions (one of them in terms of and the other 
in terms of U\ u ^). Because of this, q\ and q% enter with opposite signs when expressed as 
derivatives of the Wilson loops. This sign is not visible in the terms with q\ or q\ but it 
changes the sign of the cross term, giving complete agreement with the factorized expression. 

As done for the previous channel, let us consider the graphical representation of this 
channel in the large- A^ limit in Fig. [TTJ After replacing the gluon line with a quark-antiquark 
pair we are left with two independent fermion lines which scatter separately with the nucleus. 
Each of them resembles the photon emission case and therefore we expect, even before 
performing the calculation, to obtain a convolution of two Fourier transforms of the dipole 
cross section. In the correlation limit, the two terms in ( 155]) have a simple explanation in 
terms of a hard scattering. The first term accounts for the cases where the hard scattering 
involves only one of the two quark lines, while the second term is an interference term when 
the large momentum transfer involves the two participants. 



This channel had already been considered in 40] where, due to the choice of gauge, the 
separation of the amplitude in terms of splitting function and multiple scattering terms is 
not visible. It is possible to show our expressions above are consistent with their results 
when expressed in the same set of coordinates and momentum variables. 



3- g^gg 

In order to study the partonic process g — )■ gg, we need to derive the splitting function 
first. It can be written in momentum space as 

RL.) = ; 1 ~, , (97) 
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where V g ^ gg is just the three-gluon vertex with the coupling and color factor factorized out. 
This can be written as 



v g^gg ~ t a t b t c 



9aP (Pa ~ Pb) 1 + 9M (Pb - Pc) a + 9ja (Pc - Pa), 



(98) 



Here represents the polarization vector for gluon % with four momentum pj. It is straight- 
forward to find that 

,2 8pi 



5>< 



9^99 1 



spin 



z{\-z) 



+ 



+ z(l-z) 



(99) 



After summing over all polarizations, the squared splitting function in transverse coordi- 
nate space reads 



P 



l-z 



1 - z /-, \ 
+ + z(l - z) 



u ■ u 

u' 2 u 2 



(100) 



Now let us turn our attention to the multiple scattering terms. Since all the particles 
involved in the process are gluons, all terms contain only Wilson lines in the adjoint rep- 
resentation. In the following we give the explicit forms of the scattering terms with their 
respective large- N c expressions in terms of fundamental Wilson lines. 



fade [W(x 1 )W^(x' 1 )f [W{x 2 )W\x' 2 )] ec f abc ) ~ (Trll^x (x'J) x (TrU (x 2 )U^x' 2 )) 



X 



< TrU(x 1 ) U ] (x[ ) U(x' 2 ) [/ f (x 2 ) ) 



(101) 



(f ade W db (x 1 )W ec (x 2 )f fbc W fa (v , )) Xg * {TrU^( Xl )U(v')) Xg (TrU(x 2 )tf (v')) Xg 

x (TrU( Xl )U\x 2 )) Xg , 
(f ade W db (x[)W ec (x' 2 )f fbc Wf a (v)) Xg ~ (Trtf(v)U(x[)) Xg (TrU(v)tf \x' 2 )) Xg 

x (Tr^^Vi)}^, 
N c (TrW(v)W\v')) ~ N c (TrU*(v)U(v')) (TrU(v)U*(v')) 



(102) 

(103) 
, (104) 

Xg \ / Xg \ I Xg 

The correlation limit is applied by following the procedure developed in the DIS case and 
reproduced in the q — >■ qg channel. By inspection of the multiple scattering terms above, 
it is easy to see that the same kind of cancelations occur for this channel. Since the lowest 
order terms left after the various cancelations come from the first of the scattering terms, 
it is easy to see that the final result will involve combinations of one WW distribution and 
two Fourier transforms of the dipole cross section. After some algebra we arrive at 



dvs = x p g(x p )64n 3 ^z(l - z) 



+ 



+ z(l-z) 



d 3 v d 3 v' 
(27r) 3 (27r) 3< 



D -iq ± -(v-v') 



(z 2 + (1 - zf) A- (TrU(v)tf(v')) Xg <Tr [F^ (v)U^ F^ (if)U^] ) Xg 
+ ^U{v)U\v')) Xg ±- (TrU(v')U\v)) Xg <Tr [F^ {v)U^ {v>)U^]) 3 



+2*(1 - z)±- (W{v)U{v)U\v')) Xa ±r (TrF^ (v')U (v')U^v)) Xo 



C <3 N r 



(105) 
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FIG. 12. Graphical representation of the splitting g — > gg in the large- limit, in the amplitude 
and the conjugate amplitude. 

which is straightforward to compare to the factorized expression in (I75|) . 

This structure could have been anticipated by looking at the graphical representation of 
this process in the large- N c limit shown in Fig. [12j In terms of the hard scattering picture 
used in previous sections the structure of the expression above is consistent with previous 
results. The first and third term look exactly the same as the two terms in the g — )■ qq case 
and they correspond to the case in which the hard scattering does not involve the inner loop 
in Fig. [12j The second term corresponds to the case where the hard scattering occurs in 
the inner loop. It has the same structure as one of the terms found in the q — )■ qg case with 
an additional convolution associated to the extra quark line in the top of the diagram. 

V. CONCLUSION 

In this paper, we have studied and established an effective ^-factorization for dijet pro- 
duction at small-x in dilute-dense collisions. Although fc t -dependent parton distributions 
are different in different processes, they can be calculated and related to each other. We 
found that there are two fundamental unintegrated gluon distributions, namely, xG^> and 
xG^ 2 \ at small x. Although other different gluon distributions appear in many different 
dijet production processes, one can compute them and find that they are related to these 
two fundamental gluon distributions in the large N c limit. In terms of the CGC framework, 
this means that the two- and four-point functions are enough to compute any dijet cross 
section, in the small momentum imbalance limit. In addition, there is similar conclusion for 
the quark distributions at small x [3|. By doing so, we can restore the predictive power of 
the theory. 

Therefore, as part of the conclusion, we would like to summarize the empirical rules in 
the large- N c limit! 10 ! in this effective fct-factorization for dilute-dense system as follows: 

• The cross section can be still separated into the products of the hard parts and parton 
distributions; 

• The hard factors should be calculated separately for each individual graph since the 
parton distribution associated with each graph may be different; 

10 Note that one does not need to take the large N c limit in the calculation of xG^ and xG^ in DIS dijet 
and photon-jet in pA collisions, respectively. However, the large N c limit is essential in order to eliminate 
other non-universal distributions or correlators in other different dijet channels, i.e., qg —¥ qg, gg — > qq 
and gg — > gg in pA collisions. 
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• By replacing gluons into double lines, transform the Feynman graphs of the hard part 
into large N c planar graphs. The planar graphs show that there are only two building 
blocks, namely, quark lines and color singlet quark loops, in any graphs (see e.g., 
Figs. EH Eland DSD; 

• Quark lines always have interactions with the dense target which contribute xG^ or 
Fx g {c[±) to the gluon distribution. However, the color singlet quark loop may or may 
not interact with the dense target due to its peculiar color structure. If the quark loop 
participates the interaction, it contributes xG^ to the gluon distribution. 

• If there are multiple objects involved in the soft interaction, the resulting gluon dis- 
tribution can be written in terms of convolutions of all contributions in momentum 
space. For quark distributions in the dense target, there are similar rules which can 
be found in Ref. [7|. 

Usingthe above rules and calculating the coefficients of gluon distributions as illustrated in 
Ref. |2| , it is then straightforward to write down cross sections in terms of products of hard 
parts and parton distributions as illustrated in the context of this paper. 

There have been ambiguities regarding the unintegrated gluon distributions for more 
than a decade. In this paper, we resolve this decade-long puzzle through explicit operator 
definitions and propose measurements in physical processes which probe the distributions 
directly. In particular, we find that quark-antiquark correlation in DIS collisions can probe 
the Weizsacker- Williams gluon distribution formulated in CGC many years ago. 

It is well-known that in the color dipole approach, the cross sections of inclusive DIS and 
SIDIS at small-re are proportional to the dipole cross section. Since the dipole cross 



section can be written as Fourier transform of the normalized gluon distribution F x (qj_), 
we can study the unintegrated gluon distribution xG^ of nuclei through inclusive DIS and 
SIDIS at EIC. Moreover, using DIS dijet (dihadron) processes at EIC in the correlation 
limit, we can directly probe the Weizsacker- Williams gluon distribution xG^- 1 ' which is the 
distribution that actually counts the number of gluons in the nuclear wave function. This 
would give us the golden opportunity to access the saturated WW gluon distribution which 
has been studied for many years. 

Recently, both STAR and PHENIX Collaborations have published experimental results 
on di-hadron correlations in dAu collisions, where a strong back-to-back de-correlation of 
the two hadrons was found in the forward rapidity region of the deuteron j37|. These results 
have stimulated a number of theoretical calculations in the CGC formalism, where different 
assumptions have been made in the formulations [38], 3j|, though not the correlation limit 



we had to use in the present study. In particular, the numerical evaluation in Ref. |36|, [38 
only contains the first term in the qg channel in Eq.(B22). The second term, as well as 
other missing terms due to the use of a Gaussian approximation [l9[, are equally important 
and should be taken into account to interpret the STAR data. In addition, we also present 
the first CGC calculations on the g — )■ qq and g — > gg channels in pA collisions. Although 
these channels are subdominant in the forward dijet productions, they are important in the 
central rapidity region. 

At RHIC and LHC, by measuring the direct photon-jet correlation in pA collisions, one 
can gain direct information about the dipole unintegrated gluon distribution xG^ 2 '. Fur- 
thermore, by investigating the dijet (quark-gluon or gluon-gluon jet) production in the cor- 
relation limit, one can test the universality of gluon distributions, and begin to see the con- 
volutions of these two unintegrated gluon distributions. Using dijet production with more 
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general kinematics, one can even probe deeper the small-x QCD dynamics, as multi-gluon 
distributions become crucial. 
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Appendix A: Derivation of two gluon distributions 
1. The Weizsacker- Williams gluon distribution 

Let us focus on the Weizsacker- Williams gluon distribution xG^ (x, q±) first. Here we 
provide a derivation of this gluon distribution from its operator definition, together with the 
gauge links for a large nucleus, by using the McLerran- Venugopalan model. According to 
its definition 

/AC A 2 C p —iq±-(,± — ixP + £ t - 
4 *; o , 3 p+ < Tr [F (0 U^F (0) W' + '] > , (Al) 
(Z7TJ r + 

with £/M = U n [0, +oo; 0]£/ n [+oo, The definition above is gauge invariant. However, 

in order to simplify the calculation, we have chosen to use covariant gauge. Thus, it is then 
easy to write it as 

f AC A 2 C p -iq±-(,±-ixP+£- 

xG^\x,q L ) = 2 / g= <Tr[^[+oo,e-;a]^(0^ nt [+oo,^;a] 

J (27T) r + 

x t/ nt [0,+oo;0]F(0)f/ n [0,+oo;0]]) 

d£-d 2 £ ± e- iq ^- ixP+ t- 

(2tt) 3 P+ 

; [W^[+oo, £ ± } ab F a (0 W n % +oo; 0} cb F c (0)] > , (A2) 



where PU^f+oo, t;±] a b now is in the adjoint representation. Following Belitsky et al [20j, 
we can insert a complete set of one particle intermediate states. Notice that for the quark 
distribution at small-x, we need to have two particle intermediate states due to the antiquark 
spectator. Therefore, the gluon distribution reads 

r ac A 2 c . P -iq±<±-ixP + Z- r ^p' 
xG^(x, g± ) = J ^ d ( 27r)3p+ J ^yi(2^)5(P /2 -m 2 )4(0)A(O, (A3) 
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where we introduced the amplitude 

A(0 = (P'\W nj [+oo,^;^} ab F a (0 \P) . 
This amplitude then takes the form 



A(0 



(2^) 4 
to tl 

xG (1 \x,q ± ) 



<Pk e^- k++ ' l ^(2n) 4 5 w (k + P' - P)A a {k). 



By plugging in everything into the definition, we find 

2 



(27r) d (2P^ 



Al(x,q±)A a {x,q ± ). 



(A4) 



(A5) 



(A6) 



Let us define A a {x , R±) as the Fourier transform of A a (x, q±). Like what we have done for 
the quark distributions Q, we can compute the diagrammatic contributions to A a (x_, R±) 
order by order and resum it in coordinate space which gives 



A a (x_, R ± ) = W ba {x-,R ± ) F b +i (R A 
where 



d 2 x ± p b x ± ) W ba R±) V R± G (R± - x ± ) 



W(x-,R\ 



Texp 



dz- I d 2 z±G (x± — z±) p c (z_, z±)t c 



(A7) 
(A8) 



with t c being the adjoint color matrix. Eventually, one should be able to write 



xG^'(x, q±) 



47T 3 



dz- I d R\ 



xAl{z-,R ± )A a {z'_,R' ± 



(A9) 



In arriving to the expression above, we have put in a normalization factor of 2P + which 
comes from (P \ - ■ ■ | P). In the following, we need to average the above expression with a 
gaussian distribution of color charges as proposed in CGC. Therefore 



xG {1 \x,q ± ] 



4ir 3 



d 2 R ± / d 2 R' , e M*x-«i)+^ + ( 



+00 



dz~d 2 z± 



+00 



dz' d 2 z 



x (p b {z~,z ± ) W ba (z~, R ± ) p c (z'-,z' ± ) Wl (z'-,R' x )) p 
xV fll G (R ± - z ± ) V R/± G (R' ± - 4) . 
Assuming a Gaussian distribution of sources, it is easy to prove that 
(p b {z-,z ± ) W ba R ± ) p c (z'_,z' ± ) Wl (z'_,R' ± ) ) p 
= (p b zj.) p c {z'_,z' x )) (W ba (z_, R ± ) Wl (z'_,R' ± ) ) p 
= 5 bc S (z_ - z'_) p 2 (z_,z ± - z' ± ) (W ba (*_, R ± ) Wl (z'_,R' ± ) ) p , 

by using the fact that t c ab = —if abc {Note that f abc is anti-symmetric. See 42| for more 
details). Therefore, we have 

xG {1) {x,q ± ] 



(A10) 



(All) 
(A12) 



^ J d 2 R ± J d^eM^-^x) J °° dz„d 2 z ± d 2 z' ± p 2 (z_,z ± -z' ± ) 
xTr (W (z_,R ± ) W^ (z„,R' x )) V R± G (R ± - z ± ) V R/± G (R' ± - z' ± ) . (A13) 
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The different factors in the integral above can all be written in terms of a single function 
when a Gaussian distribution of sources is used. Let 

T(R ± - R' ± ) = g 4 J d 2 z ± d 2 z' ± dz_fi 2 (z_,z ± - z' ± ) 

x [G(R ± - z ± ) - G{R' ± - z ± )] [G(z' ± - R ± ) - G(z' ± - R' ± )] . (A14) 
In terms of T, Eq. (|A13[) takes the form 

xG {1) {x,q ± ) 



167T 4 a s N r 



I - oxp ( — ^ r (r ± ) 



(A15) 



In particular, for the McLerran-Venugopalan model, the function T can be evaluated explic- 
itly giving the well-known result 



xG (1 \x,q ± ) 



4vr 4 a s N r . J 1 r 2 



1 -exp ( --r 2 ± Q 2 s 



where Q 2 is the gluon saturation scale. 



(A16) 



2. The dipole gluon distribution 

According to its definition 

/AC A 2 C „ — iq±-£,± — ixP + i;- 
^ { l )3p+ < Tr l F (0 u [ - ]] f (0) WW] ) 



x 



G (2 \x, 



d£_d 2 £ ± e- iq ^- ixP+ t- 
(2tt) 3 P+ 



(A17) 



with WW = [/™[0,+oo;0]f/ n [+oo,£_;£j and Z/H = U n [0, -co; 0]c7 n [-oo, £J. Here we 
have chosen the covariant gauge to do the calculation. Thus, one gets 



(A18) 



x <Tr p7»[+oo,£_;£]_]F(0 ^[-oo,£_;£j 
£/ nt [0, -00; 0}F (0) c7 n [0, +00; 0]] ) . 

By inserting the intermediate state and replacing the (P \ ■ • ■ | P) by the ensemble average 
(■ ••},, we get 



2tt 3 



with 
and 



d 4 fc 



(A19) 
(A20) 



B(0 



y _|J_ e ^ + +^x (2 7 r) 4 ( 5( 4 ) (Jfe + P' - P) B(jfe) (A21) 



In CGC, we can find that, in covariant gauge, the only non-trivial field strength is F+ l (x±) 
_qi j r\ 2 y ± Q _ y ± } p a y ± y Therefore, if we write 



B{q) = j dx_d 2 R ± e iR ^B(x^,R ± ) (A22) 
= J dx.d 2 R ± e iR ^U n [+oo,x.;R ± ]F(R ± )U nj [-oo,x^R ± ], (A23) 
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we can easily see that B(q) oc d l U nj! [— oo, +00; R±]. In the above derivation, we have to 
assume that e ix k+ ~ 1. This can be justified by noting that x is integrated from — L to 
+L with L being the longitudinal width of the nucleus. For small k + = xP + with small x, 
we have k + L <C 1. 
Eventually, one gets 



x- 



G^{x, 



Nc_ 
2tt 3 

qjNo 

2n 2 a, " " 



d 2 R< d 2 R\e iq ^- R 



r\V R± -V R > 1 



N, 



Tr [(U(R ± )UHR' ± )) P ] 

(A24) 



with F° g (ql) = J ^ e ^±Tr{U (r±) & (0)) 



p ~ n2 ex P 



Q 2 



1 J 



(A25) 



where Q 2 q = ^ln^^ and jj 2 = \Cp Jdx /j 2 (x ). Here the saturation scale Q 2 is 

obtained from the fundamental representation and it is usually called quark saturation 
momentum. 

Appendix B: Evaluations of Correlators 

Here in this section, we summarize the evaluation of the correlators in CGC used above 
in the main context of the paper. 



1. The evaluation of two point functions and (TrWPl) 



First of all, as derived in Ref. [30], for an arbitrary single Wilson line (start at a and 
end at b~) , one can get 



g A C, <■'> 



dz fi 2 (z ) / d 2 z±G 2 (x± — z±) 



(U [a ,b \x±)) = exp 



Using this result, it is easy to derive that 

(U (a~, b~~\x±) U (b~ , c~\x±)) = (U (gT, c~\x±)). 

The derivation is based on time ordering of z~ and pairing of two adjacent operators. 
Furthermore, for two infinite Wilson lines of different transverse position, one gets 



(Bl) 



(B2) 



(U(x ± )tf ( yi _)) =exp 



g*c F 



exp 



g'c 



16/T 



x± - y±) In 



dz fi 2 (z ) / d 2 z± (G(x± — z±) — G(y± — z±))' 
1 



A 2 (x± - y ± ) 



dz fi (z ) 



(B3) 



where 1/A stands for the cut-off in the integral. Thus, usually one writes {U (x±) W (y±)) 



exp 



Qi(x±-y±f 



with Q 2 = In Wt 1 ra f + °° dz'fi 

4tt X 2 (x ± -y ± Y J -00 r" 
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Now we are ready to evaluate (TrWPl) and show that it is the same as the correlator of 
two infinite Wilson lines. According to the definition, one can easily find that 

(TWPl) = (Tr [U (0, +oo|0 ± ) U (+00, £~ l£±) U {C, -<x>\£±) U (-00, 0|0j_)] ) (B4) 
= (Tr[U(0 ± )UH£ ± )]) (B5) 

where we have used Eq. ( 1B2I) . Therefore, one can easily relate ^-(TW [n J) to F x (q]_) through 
Fourier transform. 



2. Evaluation of the 4-point function with a Gaussian distribution of sources 

The derivation presented here follows closely the method presented in 4(j and used also 



m 



31| where other 4-point functions have been calculated. For the sake of completeness and 



to make the presentation self-contained we will briefly review how to calculate the 4-point 
function Sxg(xi, x 2 ; x' 2 , x[) = jj- (TrU '(xi)W (x'^U \x 2 )U^ (x 2 )) x for a Gaussian distribution 

of charges. Details of the general procedure are given in [401 ] . 

The nuclear average of a function of the gauge field is defined by 

(f[A]) Xg = j ^pexp|-| d 2 xd 2 ydz+ ^^^If y) ]fl A ^ (B6) 

where the color charge p and the gauge field are related by 

- V 2 ± A~(z + ,x) =g s p c (z + ,x) , (B7) 

and p Xg (z + ) * s the density of color charges at a given z + . This Gaussian distribution allows 
us to express any correlator in terms of the elementary correlator of two color charges 

(p c (z\x) Pd (z'\y)) Xg = S cd 5(z + - z>+)5 (2 \x - y)p 2 Xg {z + ). (B8) 

In order to do this, the Wilson lines must be expanded in terms of gsp and then apply 
Wick's theorem. The Wilson lines are naturally expanded in terms of the gauge field and 
not the color charge density, therefore it is useful to express the elementary correlator (1B8[) 
in terms of the gauge field. 

gl(A-(z + ,x)A d (z'\y)) Xg = 5 cd 8{z + - z' + U g (z + )L xy , (B9) 

with L given in terms of the two-dimensional massless propagator Go, 

d 2 z G (x - z)G (y - z), G (x) = J T^p^r- (BIO) 



This correlation between two fields has the color structure of a gluon link. This, together 
with the locality of the correlator in the z + variable, allows for a graphical representation 
of each of the terms of the expansion of the Wilson lines. For the particular color structure 
of the 4-point function we are interested in diagrams which look like the left hand side 
of Fig. [131 One kind of contribution from these diagrams that is easy to evaluate is the 
contribution from links with both ends attached to the same line. For a Wilson line at a 
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FIG. 13. Graphical representation of the terms in the series expansion of the 4-point function. 



transverse coordinate x this kind of link gives a factor of —Cfh 2 L xx /2, it has a color singlet 
structure and therefore factors out in the evaluation of any specific diagram. When multiple 
insertions of these particular links are taken into account they can be resummed into the 
factor 



(Bll) 



where the contributions from the four Wilson lines involved in the correlator have been 
included. After factoring out the so-called tadpole contributions we are left with diagrams 
in which all gluon links connect different Wilson lines. The strategy to evaluate this diagrams 
is to include the proper L factors for each of the gluon links and to resolve the color structure 
by means of the Fierz identity T^T^ = \5u5jk — ^fSij^ki- The resummation then is not a 
trivial task since each diagram will end up being written as a linear combination of the two 
topologies shown in the right hand side of Fig. [13j As shown in 40[ this difficulty can be 
overcome by grouping diagrams according to the number of gluon links and then using an 
inductive procedure to find the value of the nth term in the series. By explicitly resolving 
the nth link and using the notation in Fig. [13] we have, 



'n-1 



(B12) 



where the matrix M is given by all the different ways in which the nth link can be attached, 

M = ( Cf ^ Lx ^ + Lx 2 x + 2fc F ( x i' x 2; x' 2 , x[) -\F{x x , x[; x' 2 , x 2 ) 

y -\F{xi, x 2 ; x' 2 , x\) C F {L Xxx ^ + L X > 2X2 ) + ^F(x h x[; x' 2 , x 2 ) 

(B13) 

with F(x\, x 2 \ x' 2 , x[) = L X1X ' 2 — L xi3 s +L X2X > i —L X2X ' 2 . It is easy to solve this recursion relation 
taking into account the initial condition ao = 1, bo — 0. Formally, the solution reads 



(B14) 



In order to find an explicit solution we have to find the eigenvalues A± and eigenvectors of 
M. The solution then takes the form 



i=i 



a+\\ + a_A™ 
b + \ n + + 6_A'! 



(B15) 
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with 

A± = 

a± -- 



("T ~ 2lv~) <yLxiX2 + L ^i) + If ( L *i4 + + ^ F ( x uX2;x' 2 ,x' 1 ) ± ^v 7 ^ , 

(B16) 



A ±F(x 1 ,x 2 ;x 2 ,x' 1 ) 
2y/A 



F{X\i x 2 , x 2 , x^, 



(B17) 
(B18) 



With this result we can now easily resum the contribution from all the diagrams. The 
4-point function is then given by 



S Xg ( x li x 2: x 2> X \j 



T 



[N c a n (z+, 4) + NX(4, ■ • • , 4)] ■ (B19) 



n=0 J z i <—< z n 

When written in terms of the eigenvalues above this expression can be resummed into 

T 



^x]{ X li x 2', x 2i x l) 



N c (a+e M A+ + a_e 



N'i [ b+e" A + + 6_e 



(B20) 



Using the explicit values shown above, this expression takes the form 



&Xg (^1) x 2] x 2i x \> 



-^tTCxi-xaJ+r^-xi)] 



X 



+ 



A + F(xi,£' 2 ;£ 2 ,3a) F(ar 1 ,x 2 ;a:' 2 ,ar' 1 ) 



2v/A 



A - F(x l7 x' 2 ; x 2 ,x[) F(xi, x 2 ; x 2 , x[) 



x e -^^ 2 F(x 1 ,x' 2 ;X2,x' 1 )+j^^ 2 F{x 1 ,x 2 ;x , 2 ,x' 1 ) 



(B21) 



with r(x - y) = /i 2 (L xx + L yy - 2L xy ). 

Taking the large- N c limit of this result we find a much simpler expression, 



,x 2 ;x 2 ,x 1 )~e 2 



■[r(x 1 -x 2 )+r(x' 2 -x' 1 )} 



F(XI,X2;X 2 ,X' 1 ) / e _^L[ r(3 . 1 _ a . 2 ) +r ( a .^_ a ./ i )] _ e _^L[ r ( a!1 _ a; ' i ) +r (^_ a . 2 ) 

F(x 1 ,x' 2 ;x 2 ,x' l ) V 



(B22) 

Note that even this large- N c result can not be expressed as a product of 2-point functions. 
The two terms appearing above have a simple interpretation in terms of multiple scatter- 
ings that allows us to label the first term as the elastic part and the second term as the 
inelastic part (see 35| where the same scattering factor appears in the context of two-gluon 
production). Taking into account that the Gaussian distribution of sources is equivalent 
to the two gluon exchange approximation with independent scattering centers, it is easy to 
see that the first term is what you would expect if only interactions that don't break up 
nucleons in the nucleus were allowed. In that scenario the quark- ant iquark pair is always 
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on a singlet state and the interaction factor can be written as the product of the interaction 
term for a dipole in the amplitude times an interaction term for the dipole in the conjugate 
amplitude. The inelastic part takes into account all the interactions were at least one of the 
nucleons is broken apart, in which case the quark- antiquark pair goes from a singlet state to 
an octet state. In the large- N c limit transitions from the octet state to the singlet state are 
suppressed and therefore the pair remains in the octet state for the rest of the interaction 
with the nucleus. 
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